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1. Breakdown, equivariance and invariance. The authors are to be con- 
gratulated for their excellent paper, which nicely clarifies the role of equiv- 
ariance in finding upper bounds for the breakdown points of functionals. The 
breakdown point approach, with upper bounds showing how far one can go, 
has achieved great success in the univariate and multivariate location, scale, 
scatter and regression estimation problems. The authors justifiably argue 
that this is due to the fact that the acceptable, well-behaved estimates in 
these contexts have natural equivariance properties. In constructing reason- 
able estimates and test statistics, one therefore considers statistics satisfying 
certain conditions (invariance, equivariance, unbiasedness, consistency, etc.). 
If there are no restrictions, the upper bound is one as the breakdown point 
(using the common definition) of a "stupid" constant functional, for exam- 
ple, is one. 

The paper is clearly written with several illustrative examples. The con- 
structive proof of the main Theorem 3.1 illustrates how one can concretely 
break down an equivariant estimate: 

1. Pick a transformation g corresponding to the set with the supremum 
probability mass in (3.3). 

2. Apply the transformation g or g -1 repeatedly to contaminate a (random) 
half of the data outside the set with the supremum probability mass. 

In the one-sample location problem with sample size n = 2k, for example, 
the translation equivariance of a location estimate T(x±, . . . , x n ) means that 

T(xi + c, . . . ,x k + c,x k+1 , ...,x n )- T(xi, . . .,x k ,x k+ i -c,...,x n -c) = c 

and, consequently, the estimate can be broken either by repeatedly shifting 
the first half of the data by +c or by repeatedly shifting the second half of 
the data by — c. 
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The theory thus yields upper bounds for the breakdown points of the 
affine equivariant univariate location and scale functionals but does not say 
anything about affine invariant skewness and kurtosis statistics, for exam- 
ple. Clearly the invariant classical skewness statistic 
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does not break down with one outlying observation, although affine equiv- 
ariant second and third central moments both do. For a single outlier going 
to infinity, the central moments move "beyond all bounds" but b\ converges 
to a constant (n — 2) 2 / (n — 1) . As this limit is not data dependent at all, the 
contaminated statistic b\ does not convey any information on the original 
data points. Is this a breakdown? Another strange example is the estimation 
problem for the parameters of the linear predictor 9q + @' x i n the generalized 
linear model. Again, for n = 2k, an equivariant estimate of satisfies 
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and the estimate can be moved beyond all bounds or to zero by repeat- 
edly multiplying half of the data by c or by 1/c. The estimate then seems 
to become uninformative. Is something wrong with the definitions of the 
breakdown and the breakdown point? What do we really mean when we say 
that a breakdown occurs? 

2. When does the breakdown occur? Since the early notions by Ham- 
pel (1971), the concept of breakdown point has been widely discussed and 
further developed by several contributors. For considering and comparing 
different approaches we adopt the following notation. Let X = {x\, . . . ,x n ) 
be an original "true" sample of size n lying in the sample space X. The 
statistic ("estimate") considered is denoted by T(X) with possible values in 
T = {T(X) : X G X} C W. We say that a point t is interior to T if it belongs 
to T and there is a neighborhood of t which contains only points of T. A 
point t € MP is exterior to T if it does not belong to T, and if there exists 
a neighborhood of t which contains no points of T. Finally, t is called a 
boundary point of T if t is neither interior nor exterior to T. Often T = MP 
and then there are no boundary points. 

We next construct a contaminated sample. Let S = (s±, . . . , s n ) be a vector 
of zeros and ones indicating the contamination, and Y = (yi, . . . , y n ), also 
in X, a sample of "outliers." The contaminated sample then consists of the 
observations (1 — Si)Xi + siyi, i = 1, . . . ,n. The number of outlying or alien 
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observations is accordingly s = s i ■ The contaminated value of the estimate 
is then 

T(X,Y,S) = T((l - si)xi + siyi, . .., (1 - s n )x n + s n y n ). 
The breakdown of the estimate is most often defined as follows. 

Definition 1. T breaks down at X with s outliers if there exist a 
sequence (Y m ) in X and S with J2 s i = s such that ||T(X, Y m , S)\\ — » oo. 

The breakdown point then gives the smallest fraction of outliers (s/n) 
that suffices to " drive the estimate beyond all bounds." According to this 
definition, a constant estimate (T(X) = to) can never be broken down. Note 
also that the scale estimate and scatter matrix estimate are usually thought 
to break down also if they converge to a boundary point of T (scale estimate 
converges to zero and the smallest eigenvalue of the scatter matrix estimate 
converges to zero). In the paper this is taken care of with a suitably chosen 
pseudometric; see, for example, Section 4.2. Another possibility is to give a 
new definition: 

Definition 2. T breaks down at X with s outliers if there exist a 
sequence (Y m ) in X and S with J2 s i = s such that either (i) ||T(X, Y m , S) \\ — ► 
oo or (ii) T(X,Y m , S) —> to where to is a boundary point of T. 

If the constant functional T(X) = to does not depend on X, T = {to} 
and to is also a boundary point. Therefore T breaks down for all S. Note 
that the boundary point to i n the definition may depend on X, however. 
In the simple regression example suggested by the referee and analyzed in 
Section 6, the statistic T(P n ) has values in T = [—n,n] and it breaks down 
(in the sense of Definition 2) if Yl s i = 1- 

Genton and Lucas (2003) take a different viewpoint and argue that a 
crucial property of an estimator T(X, Y, S) is that it takes different values 
for different values of X € X and that the breakdown occurs if this property 
is lost. In this spirit one can say that: 

Definition 3. T breaks down with s outliers if there exist a sequence (Y m ) 
in X and S with ^2si = s such that either (i) ||T(X, 1^,5)11 — > oo, for all 
X £ X, or (ii) T(X, Y m , S) -» t G R p , for all X £ X. 

In this definition, it is remarkable that the interior or boundary point to 
is not allowed to depend on X at all. This definition solves the problem 
with the classical skewness statistic; b\ can be made to break down with a 
single extreme outlier. I wonder whether the techniques and results in the 
paper by Davies and Gather could be expanded to cover this definition also. 
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Genton and Lucas (2003) seem in fact to be still more permissive and say 
that T(X, Y, S) breaks down if 

T n jlimT(X, Y m , S) :X e # J 

collapses to a finite set; an empty set and a singleton {to} are then special 
cases. Given a continuum of values of X, one expects a continuum of pos- 
sible values of the estimate. In the linear predictor estimation problem this 
definition implies that the breakdown point of an equivariant estimate of 6 
is at most one half. 

All the approaches described above work with the worst possible scenario 
represented by a strategically chosen sequence of the sets of outlying obser- 
vations (Y m ). In practice, the observed contaminated value of the estimate 
T(X,Y, S) is in T, however, and not a boundary point, and one can ask 
whether the estimate still conveys useful information about the true data 
cloud or not. Then, instead of speculating about the sequences (Y m ), one 
may consider the set of possible values of T(X, Y, S) for all choices of Y £ X. 
With (at most) s outliers, the set of possible values of T(X,Y,S) is 

%(X) := {T(X, Y,S):YeX,J2s l = s}. 

Then clearly 

{T(X)} = %{X) C 71 (X) C T 2 (X) C-cT 

and the value of the estimate is totally determined by s outliers if %(X) = T. 
More generally, we can define that: 

Definition 4. T breaks down with s outliers if the set T S (X) does not 
depend on X. 

Note that if T is afline equivariant /invariant, then also T S (X) is affine 
equivariant /invariant. Assume next that the observed value of T(X,Y,S) 
is t. If we knew the maximum number of outliers in the data set, say s, but 
S and Y are unknown, the observed event T S (X) 3 t may still be informative. 
In the univariate location case with n = 2k — 1 and X = M n , T S (X) = R for 
the sample mean if s > 0. But for X = [0, oo) n , for example, the breakdown 
point of the mean is one as T S -\{X) 3 t <J=^> xn-\ < n ■ t. For the sample 
median, the event 

T s (X)3t a;(fc- s ) <t <Z(fc +s ), s = 0,...,k, 

is clearly data dependent and therefore carries information about the data 
cloud. 
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